Learning surrogate models for simulation-based optimization
نویسندگان
چکیده
We address a central problem in modeling, namely that of learning an algebraic model from data obtained from simulations or experiments. We propose a methodology that uses a small number of simulations or experiments to learn models that are as accurate and as simple as possible. The approach begins by building a low-complexity surrogate model. The model is built using a best subset technique that leverages an integer programming formulation to allow for the efficient consideration of a large number of possible functional components in the model. The model is then improved systematically through the use of derivative-free optimization solvers to adaptively sample new simulation or experimental points. We describe ALAMO, the computational implementation of the proposed methodology, along with examples and extensive computational comparisons between ALAMO and a variety of machine learning techniques, including Latin hypercube sampling, simple least squares regression, and the lasso.
منابع مشابه
Pareto-Based Multi-output Metamodeling with Active Learning
When dealing with computationally expensive simulation codes or process measurement data, global surrogate modeling methods are firmly established as facilitators for design space exploration, sensitivity analysis, visualization and optimization. Popular surrogate model types include neural networks, support vector machines, and splines. In addition, the cost of each simulation mandates the use...
متن کاملA Surrogate Modeling and Adaptive Sampling Toolbox for Computer Based Design
An exceedingly large number of scientific and engineering fields are confronted with the need for computer simulations to study complex, real world phenomena or solve challenging design problems. However, due to the computational cost of these high fidelity simulations, the use of neural networks, kernel methods, and other surrogate modeling techniques have become indispensable. Surrogate model...
متن کاملSpatiotemporal Estimation of PM2.5 Concentration Using Remotely Sensed Data, Machine Learning, and Optimization Algorithms
PM 2.5 (particles <2.5 μm in aerodynamic diameter) can be measured by ground station data in urban areas, but the number of these stations and their geographical coverage is limited. Therefore, these data are not adequate for calculating concentrations of Pm2.5 over a large urban area. This study aims to use Aerosol Optical Depth (AOD) satellite images and meteorological data from 2014 to 2017 ...
متن کاملAdaptive Simulation-based Training of AI Decision-makers using Bayesian Optimization
This work studies how an AI-controlled dog-fighting agent with tunable decisionmaking parameters can learn to optimize performance against an intelligent adversary, as measured by a stochastic objective function evaluated on simulated combat engagements. Gaussian process Bayesian optimization (GPBO) techniques are developed to automatically learn global Gaussian Process (GP) surrogate models, w...
متن کاملA Survey of Model-based Methods for Global Optimization
This article describes model-based methods for global optimization. After introducing the global optimization framework, modeling approaches for stochastic algorithms are presented. We differentiate between models that use a distribution and models that use an explicit surrogate model. Fundamental aspects of and recent advances in surrogate-model based optimization are discussed. Strategies for...
متن کامل